智能论文笔记

Are You Comfortable Now: Deep Learning the Temporal Variation in Thermal Comfort in Winters

Betty Lala , Srikant Manas Kala , Anmol Rastogi , Kunal Dahiya , Aya Hagishima

分类：机器学习 | 人工智能

2022-08-20

智能建筑中的室内热舒适对乘员的健康和表现有重大影响。因此，机器学习（ML）越来越多地用于解决与室内热舒适的挑战。热舒适感的时间变化是调节居住者福祉和能耗的重要问题。但是，在大多数基于ML的热舒适研究中，不考虑时间中的时间方面，例如一天中的时间，昼夜节律和室外温度。这项工作解决了这些问题。它研究了昼夜节律和室外温度对ML模型的预测准确性和分类性能的影响。数据是通过在14个教室中进行的长达一个月的实地实验收集的，其中512名小学生。四个热舒适度指标被认为是深神经网络的输出，并支持数据集的向量机模型。时间变异性对学童舒适性的影响通过“一天中的时间”分析显示。预测准确性的时间差异已显示（多达80％）。此外，我们表明室外温度（随时间变化）对热舒适模型的预测性能产生了积极影响高达30％。时空环境的重要性通过对比的是微观级别（特定于位置）和宏观级别（整个城市的6个位置）的重要性。这项工作的最重要发现是，对于多种热舒适度指标，显示了预测准确性的明确提高，而天空中的时间和天空照明则有所增加。

translated by 谷歌翻译

NGAME: Negative Mining-aware Mini-batching for Extreme Classification

Kunal Dahiya , Nilesh Gupta , Deepak Saini , Akshay Soni , Yajun Wang , Kushal Dave , Jian Jiao , Gururaj K , Prasenjit Dey , Amit Singh

分类：机器学习

2022-07-10

极端分类（XC）试图用最大的标签集中标记标签的子集标记数据点。通过使用稀疏，手工制作的功能的XC方法优越，用密集，学习的数据来进行深度XC，以数据点和标签的形式吸引了很多关注。负挖掘技术已成为所有深XC方法的关键组成部分，使它们可以扩展到数百万个标签。然而，尽管最近进步，但培训具有大型编码器体系结构（例如变形金刚）的深入XC模型仍然具有挑战性。本文确定，流行负面挖掘技术的内存通常迫使小型批量尺寸保持小且缓慢的训练。作为回应，本文介绍了Ngame，这是一种轻巧的迷你批次创建技术，可证明可证明准确的内部负面样品。这使得与现有负面采样技术相比，具有更大的迷你批次培训，提供更快的收敛性和更高的精度。发现Ngame的准确性比各种基准数据集的最先进方法要高16％，以进行极端分类，并且在回答搜索引擎查询以响应用户网页时检索搜索引擎查询更准确3％显示个性化广告。在流行搜索引擎的实时A/B测试中，Ngame在点击率率中的收益最高可达23％。

translated by 谷歌翻译

Building Matters: Spatial Variability in Machine Learning Based Thermal Comfort Prediction in Winters

Betty Lala , Srikant Manas Kala , Anmol Rastogi , Kunal Dahiya , Hirozumi Yamaguchi , Aya Hagishima

分类：机器学习

2022-06-28

室内环境中的热舒适感会对乘员的健康，福祉和表现产生巨大影响。鉴于对能源效率和实现智能建筑的关注，机器学习（ML）越来越多地用于数据驱动的热舒适度（TC）预测。通常，提出了用于空调或HVAC通风建筑物的基于ML的解决方案，这些模型主要是为成年人设计的。另一方面，在大多数国家 /地区，自然通风（NV）的建筑物是常态。它们也是节能和长期可持续性目标的理想选择。但是，NV建筑物的室内环境缺乏热调节，并且在空间环境中差异很大。这些因素使TC预测极具挑战性。因此，确定建筑环境对TC模型性能的影响很重要。此外，需要研究跨不同NV室内空间的TC预测模型的概括能力。这项工作解决了这些问题。数据是通过在5个自然通风的学校建筑中进行的为期一个月的实地实验，涉及512名小学生。空间变异性对学生舒适度的影响通过预测准确性的变化（高达71％）来证明。还通过特征重要性的变化来证明建筑环境对TC预测的影响。此外，对儿童（我们的数据集）和成人（ASHRAE-II数据库）进行了模型性能的空间变异性比较分析。最后，评估了NV教室中热舒适模型的概括能力，并强调了主要挑战。

translated by 谷歌翻译

Optimizing Unlicensed Coexistence Network Performance Through Data Learning

Srikant Manas Kala , Vanlin Sathya , Kunal Dahiya , Teruo Higashino , Hirozumi Yamaguchi

分类：机器学习

2021-11-15

未经许可的LTE-WiFi共存网络正在经历一致的致密化，以满足上升的移动数据需求。随着共存网络复杂性的增加，研究网络功能关系（NFR）非常重要，并利用它们来优化密集的共存网络性能。这项工作通过监督从现实世界实验中收集的网络数据的监督学习来研究未经许可的LTE-WiFi（LTE-U和LTE-LAA）网络中的NFR。在实验中考虑不同的802.11标准和不同的通道带宽，并且精确概述了学习模型选择策略。此后，通过学习模型参数如R-SQ，残差，异常值，预测器的选择等进行不同LTE-WiFi网络配置的比较分析。此外，提出了一种基于网络特征关系的优化（NEFRO）框架。通过利用从网络数据中学到的特征关系方程，NEFRO改善了传统的优化制剂。它被证明是通过两个优化目标，VIZ的时间关键密集共存网络非常适合。，网络容量和信号强度。 NEFRO针对四个关于网络优化的工作验证。 NEFRO成功地将优化收敛时间降低多达24％，同时平均保持高达97.16％的精度。

translated by 谷歌翻译

DeepXML: A Deep Extreme Multi-Label Learning Framework Applied to Short Text Documents

Kunal Dahiya , Deepak Saini , Anshul Mittal , Ankush Shaw , Kushal Dave , Akshay Soni , Himanshu Jain , Sumeet Agarwal , Manik Varma

分类：机器学习

2021-11-12

可扩展性和准确性在深度极端多标签学习中得到了很好的认可挑战，其中目标是培训架构，以便自动注释具有来自极大的标签集的最相关标签子集的数据点。本文通过将深度极端多标签任务分解为四个更简单的子任务，开发了解决这些挑战的DeepXML框架，每个挑战可以准确且有效地培训。为四个子任务选择不同的组件允许DeepXML生成一个算法系列，在准确性和可扩展性之间产生不同的权衡。特别是，DeepXML产生了ASTEC算法，可以比公开可用的短文本数据集上的领先深度极端分类器更准确，5-30倍更快地进行5-30倍。 ASTEC还可以有效地在Bing短文本数据集上培训，该数据集包含多达6200万个标签，同时在商品硬件上进行数十亿用户和数据点的预测。这允许ASTEC部署在Bing搜索引擎上，以获取许多短文本应用程序，范围从匹配用户查询到广告商出价短语，以显示个性化广告，其中它在点击率，覆盖范围，收入和其他在线指标中产生了显着的收益目前在生产中的最先进技术。 Deepxml的代码可在https://github.com/extreme-classification/deepxml上获得

translated by 谷歌翻译

Self-Supervised Object Segmentation with a Cut-and-Pasting GAN

Kunal Chaturvedi , Ali Braytee , Jun Li , Mukesh Prasad

分类：计算机视觉 | 机器学习

2023-01-01

This paper proposes a novel self-supervised based Cut-and-Paste GAN to perform foreground object segmentation and generate realistic composite images without manual annotations. We accomplish this goal by a simple yet effective self-supervised approach coupled with the U-Net based discriminator. The proposed method extends the ability of the standard discriminators to learn not only the global data representations via classification (real/fake) but also learn semantic and structural information through pseudo labels created using the self-supervised task. The proposed method empowers the generator to create meaningful masks by forcing it to learn informative per-pixel as well as global image feedback from the discriminator. Our experiments demonstrate that our proposed method significantly outperforms the state-of-the-art methods on the standard benchmark datasets.

translated by 谷歌翻译

Concentration of the Langevin Algorithm's Stationary Distribution

Jason M. Altschuler , Kunal Talwar

分类： (统计)机器学习 | 机器学习

2022-12-24

A canonical algorithm for log-concave sampling is the Langevin Algorithm, aka the Langevin Diffusion run with some discretization stepsize $\eta > 0$. This discretization leads the Langevin Algorithm to have a stationary distribution $\pi_{\eta}$ which differs from the stationary distribution $\pi$ of the Langevin Diffusion, and it is an important challenge to understand whether the well-known properties of $\pi$ extend to $\pi_{\eta}$. In particular, while concentration properties such as isoperimetry and rapidly decaying tails are classically known for $\pi$, the analogous properties for $\pi_{\eta}$ are open questions with direct algorithmic implications. This note provides a first step in this direction by establishing concentration results for $\pi_{\eta}$ that mirror classical results for $\pi$. Specifically, we show that for any nontrivial stepsize $\eta > 0$, $\pi_{\eta}$ is sub-exponential (respectively, sub-Gaussian) when the potential is convex (respectively, strongly convex). Moreover, the concentration bounds we show are essentially tight. Key to our analysis is the use of a rotation-invariant moment generating function (aka Bessel function) to study the stationary dynamics of the Langevin Algorithm. This technique may be of independent interest because it enables directly analyzing the discrete-time stationary distribution $\pi_{\eta}$ without going through the continuous-time stationary distribution $\pi$ as an intermediary.

translated by 谷歌翻译

Cross-Domain Consumer Review Analysis

Aditya Pandey , Kunal Joshi

分类：机器学习

2022-12-23

The paper presents a cross-domain review analysis on four popular review datasets: Amazon, Yelp, Steam, IMDb. The analysis is performed using Hadoop and Spark, which allows for efficient and scalable processing of large datasets. By examining close to 12 million reviews from these four online forums, we hope to uncover interesting trends in sales and customer sentiment over the years. Our analysis will include a study of the number of reviews and their distribution over time, as well as an examination of the relationship between various review attributes such as upvotes, creation time, rating, and sentiment. By comparing the reviews across different domains, we hope to gain insight into the factors that drive customer satisfaction and engagement in different product categories.

translated by 谷歌翻译

Task Ambiguity in Humans and Language Models

Alex Tamkin , Kunal Handa , Avash Shrestha , Noah Goodman

分类：自然语言处理 | 机器学习

2022-12-20

Language models have recently achieved strong performance across a wide range of NLP benchmarks. However, unlike benchmarks, real world tasks are often poorly specified, and agents must deduce the user's intended behavior from a combination of context, instructions, and examples. We investigate how both humans and models behave in the face of such task ambiguity by proposing AmbiBench, a new benchmark of six ambiguously-specified classification tasks. We evaluate humans and models on AmbiBench by seeing how well they identify the intended task using 1) instructions with varying degrees of ambiguity, and 2) different numbers of labeled examples. We find that the combination of model scaling (to 175B parameters) and training with human feedback data enables models to approach or exceed the accuracy of human participants across tasks, but that either one alone is not sufficient. In addition, we show how to dramatically improve the accuracy of language models trained without large-scale human feedback training by finetuning on a small number of ambiguous in-context examples, providing a promising direction for teaching models to generalize well in the face of ambiguity.

translated by 谷歌翻译

Metaheuristic for Hub-Spoke Facility Location Problem: Application to Indian E-commerce Industry

Aakash Sachdeva , Bhupinder Singh , Rahul Prasad , Nakshatra Goel , Ronit Mondal , Jatin Munjal , Abhishek Bhatnagar , Manjeet Dahiya

分类：机器学习

2022-12-16

Indian e-commerce industry has evolved over the last decade and is expected to grow over the next few years. The focus has now shifted to turnaround time (TAT) due to the emergence of many third-party logistics providers and higher customer expectations. The key consideration for delivery providers is to balance their overall operating costs while meeting the promised TAT to their customers. E-commerce delivery partners operate through a network of facilities whose strategic locations help to run the operations efficiently. In this work, we identify the locations of hubs throughout the country and their corresponding mapping with the distribution centers. The objective is to minimize the total network costs with TAT adherence. We use Genetic Algorithm and leverage business constraints to reduce the solution search space and hence the solution time. The results indicate an improvement of 9.73% in TAT compliance compared with the current scenario.

translated by 谷歌翻译